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Abstract 



Andersson and Nilsson introduced in 1993 a level- compressed trie (in short: LC trie) 
in which a full subtree of a node is compressed to a single node of degree being the 
size of the subtree. Recent experimental results indicated a "dramatic improvement" 
when full subtrees are replaced by "partially filled subtrees" . In this paper, we provide 
a theoretical justification of these experimental results showing, among others, a rather 
moderate improvement of the search time over the original LC tries. For such an 
analysis, we assume that n strings are generated independently by a binary memoryless 
source (a generalization to Markov sources is possible) with p denoting the probability of 
emitting a "1" (and q = 1 —p). We first prove that the so called a-fillup level F n (a) (i.e., 
the largest level in a trie with a fraction of nodes present at this level) is concentrated 
on two values whp (with high probability); either F n {a) = k n or F n {a) = k n + 1 where 
k n = log i n — ol L^if/^l— s ^ > ~ 1 (ct)v / hin + 0(1) is an integer and $(x) denotes the 

~7p? 2 ' n 1 * (1 / y/pq) v ' x ' ° v ' 

normal distribution function. This result directly yields the typical depth (search time) 
D n (a) in the a-LC tries with p ^ 1/2, namely we show that whp D n (a) ~ C\ log log n 
where C\ — 1 / 1 log(l — h/ log( 1 / y/pq) ) | and h — —p log p — q log q is the Shannon entropy 
rate. This should be compared with recently found typical depth in the original LC tries 
which is C2loglogn where Ci — l/|log(l — h/ log(l/ mm{p, 1 — p}))\. In conclusion, 
we observe that a affects only the lower term of the a- fillup level F n (a), and the search 
time in a-LC tries is of the same order as in the original LC tries. 

Key Words: Digital trees, level-compressed tries, partial fillup, probabilistic analysis, 
poissonization. 
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1 Introduction 



Tries and suffix trees are the most popular data structures on words [7] . A trie is a digital 
tree built over, say n, strings (the reader is referred to ^lEJUH] for an in depth discussion 
of digital trees.) A string is stored in an external node of a trie and the path length to 
such a node is the shortest prefix of the string that is not a prefix of any other strings (cf. 
Figure 13). Throughout, we assume a binary alphabet. Then each branching node in a trie 
is a binary node. A special case of a trie structure is a suffix trie (tree) which is a trie built 
over suffixes of a single string. 

Since 1960 tries were used in many computer science applications such as searching 
and sorting, dynamic hashing, conflict resolution algorithms, leader election algorithms, 
IP addresses lookup, coding, polynomial factorization, Lempel-Ziv compression schemes, 
and molecular biology. For example, in the internet IP addresses lookup problem |151 I23| 
one needs a fast algorithm that directs an incoming packet with a given IP address to its 
destination. As a matter of fact, this is the longest matching prefix problem, and standard 
tries are well suited for it. However, the search time is too large. If there are n IP addresses 
in the database, the search time is O(logra), and this is not acceptable. In order to improve 
the search time, Andersson and Nilsson ^ ^] introduced a novel data structure called the 
level compressed trie or in short LC trie (cf. Figure [J). In the LC trie we replace the root 
with a node of degree equal to the size of the largest full subtree emanating from the root 
(the depth of such a subtree is called the fillup level). This is further carried on recursively 
throughout the whole trie (cf. Figure 

Some recent experimental results reported in [HI 1181 II Tj indicated a "dramatic improve- 
ment" in the search time when full subtrees are replaced by "partially fillup subtrees". In 
this paper, we provide a theoretical justification of these experimental results by considering 
a-LC tries in which one replaces a subtree with the last level only a-filled by a node of degree 
equal to the size of such a subtree (and we continue recursively). In order to understand 
theoretically the a-LC trie behavior, we study here the so called a- fillup level F n {a) and 
the typical depth or the search time D n (a). The a-fillup level is the last level in a trie that 
is a-filled, i.e. filled up to a fraction at least a (e.g., in a binary trie level k is a-filled if it 
contains a2 k nodes). The typical depth is the length of a path from the root to a randomly 
selected external node; thus it represents the typical search time. In this paper we analyze 
the a-fillup level and the typical depth in an a-LC trie in a probabilistic framework when 
all strings are generated by a memoryless source with P(l) = p and P(0) = q := 1 — p. 
Among other results, we prove that the a-LC trie shows a rather moderate improvement 
over the original LC tries. We shall quantify this statement below. 

Tries were analyzed over the last thirty years for memoryless and Markov sources (cf. 
[21 El EH C21 EH [ini EOl EH EU). Pittel H3 EOJ found the typical value of the fillup level 
F n (i.e., a = 1) in a trie built over n strings generated by mixing sources; for memoryless 
sources with high probability (whp) 

F P log?* ^ fog"- 

log(l/p m in) h 

where p m i n = min{p, 1 — p} is the smallest probability of generating a symbol and /i_oo = 
log(l/pmin) i s the Renyi entropy of infinite order (cf. [2E])- We let log := log 2 . In the 
above, we write F n ~ a n to denote F n /a n — ► 1 in probability, that is, for any e > we have 
P((l — e)a n < F n < (1 + e)a n ) —* 1 as n — > oo. 
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Figure 1: A trie and its associated full LC trie. 



This was further extended by Devroye {2|, and Knessl and Szpankowski who, among 
other results, proved that the fillup level F n is concentrated on two points k n and k n + 1, 
where k n is an integer 

3— j- (log re — log log log n) + O(l) (1) 

for p ^ 1/2. The depth in regular tries was analyzed by many authors who proved that 
whp the depth is about logn (where h = —plogp — (1 — p) log(l — p) is the Shannon 
entropy rate of the source) and that it is normally distributed when p ^ 1/2 |201 125]. 

The original LC tries were analyzed by Andersson and Nilsson |X] for unbiased memory- 
less source and by Devroye |Hj for memoryless sources (cf . also |22j ) . The typical depth 
(search time) for regular LC tries was only studied recently by Devroye and Szpankowski 
[I] who proved that for memoryless sources with p ^ 1/2 

P log logn 

Dn r i — T\ Til v 2 ) 

-log(l - h/h-oo) 

In this paper we shall prove some rather surprising results. First of all, for < a < 1 
we show that the a-fillup level F n (a) is whp equal either to k n or k n + 1 where 

k n = log i re - 'y^' r 1 ( a )Vl^+ 0(1). (3) 

As a consequence, we find that if p ^ 1/2, the depth D n {a) of the a-LC is for large re 
typically about 

log log n 
-log(l-Vlog(VVM))' 
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The (full) 1-fillup level F n shown in Q should be compared to the a-fUlup level F n (a) 
presented in ©. Observe that the leading term of F n {a) is not the same as the leading 
term of F n when p ^ 1/2. Furthermore, a contributes only to the second term asymptotics. 
When comparing the typical depths D n and D n {a) we conclude that both grow like log log n 
with two constants that do not differ by much. This comparison led us to a statement in the 
abstract that the improvement of a-LC tries over the regular LC tries is rather moderate. 
We may add that for relatively slowly growing functions such as log log n the constants in 
front of them do matter (even for large values of n) and perhaps this led the authors of 
OUZIIIHI to their statements. 

The paper is organized as follows. In the next section we present our main results 
which are proved in the next two sections. We first consider a poissonized version of the 
problem for which we establish our findings. Then we show how to depoissonize our results 
completing our proof. 

2 Main Results 

Consider tries created by inserting n random strings of and 1. We will always assume 
that the strings are (potentially) infinite and that the bits in the strings are independent 
random bits, with P(l) = p and thus P(0) = q := 1 — p; moreover we assume that different 
strings are independent. 

We let Xk := ^{internal nodes filled at level k} and := X^/2 k , i.e. the proportion 
of nodes filled at level k. Note that Xk may both increase and decrease as k grows, while 

1 > x k > x k+1 > 0. 

Recall that the fillup level of the trie is defined as the last full level, i.e. max{/c : X k = 1}, 
while the height is the last level with any nodes at all, i.e. max{A; : X^ > 0}. Similarly, if 
< a < 1, the a-fillup level F n (a) is the last level where at least a proportion a of the 
nodes are filled, i.e. 

F n {ot) = maxjfc : X k > a}. 

We will in this paper study the a-fillup level for a given a with < a < 1 and a given 
p with < p < 1. 

We have the following result, where whp means with probability tending to 1 as n — > oo, 
and $ denotes the normal distribution function. Theorem ^ is proved in Section |1J after 
first considering a Poissonized version in Section |31 

Theorem 1. Let a and p be fixed with < a < 1 and < p < 1, and let F n (a) be the 
a-fillup level for the trie formed by n random strings as above. Then, for each n there is an 
integer 

K = log i n - oi l 3 y / g) ' ^(ajVWi 0(1) 

such that whp F n (a) = k n or k n + 1. Moreover, EI(. n = a + 0(l/y / Iogn) for p ^ 1/2. 

Thus the a-fillup level F n (a) is concentrated on at most two values; as in many similar 
situations (cf. [21 111! IT^l I25j ). it is easily seen from the proof that in fact for most n it is 
concentrated on a single value k n , but there are transitional regimes, close to the values of 
n where k n changes, where F n {a) takes two values with comparable probabilities. 
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Note that when p = 1/2, the second term on the right hand side disappears, and 
thus simply k n = logn + 0(1); in particular, two different values of a £ (0, 1) have their 
corresponding k n differing by O(l) only. When p ^ 1/2, changing a means shifting k n by 
G(log 1 ^ 2 n). By Theorem^ whp F n (a) is shifted by the same amounts. 

To the first order, we thus have the following simple result. 

Corollary 2. For any fixed a and p with < a < 1 and < p < \, 

F n (a) = log^_ n + O p (Vlnn); 

•Jvq 

in particular, F n (a)/log 1 y /pqTi —>■ 1 as n — > oo. 

Surprisingly enough, the leading terms of the fillup level for a = 1 and a < 1 are 
quantitatively different for p ^ 1/2. It is well known, as explained in the introduction, that 
the regular fillup level F n is concentrated on two points around logn/ log(l/p m i n ), while the 
partial fillup level F n (a) concentrates around k n ~ logn/ log (1/^/pg). Secondly, the leading 
term of F n {a) does not depend on a and the second term is proportional to ylogn, while 
for the regular fillup level F n the second term is of order log log log n. 

Theorem ^ yields several consequences for the behavior of a-LC tries. In particular, it 
implies the typical behavior of the depth, that is, the search time. Below we formulate our 
main second result concerning the depth for a-LC tries delaying the proof to Section El cf. 
© and H H2] for LC tries. 

Theorem 3. For any fixed < a < 1 and p ^ 1/2 we have 

P loglogn 
D n {a) t x- (4) 

~ lQ g ^- logd/^ j 

as n — > oo where h = —plogp — (1 — p) log(l — p) is the entropy rate of the source. 

As a direct consequence of Theorem |31 we can numerically quantify experimental results 
recently reported in |17j where a "dramatic improvement" in the search time of a-LC tries 
over the regular LC tries was observed. In a regular LC trie the search time is O(loglogn) 
with the constant in front of loglogn being l/log(l — hj log(l/p m ; n )) _1 [1]. For a-LC 
tries this constant decreases to l/log(l — h / \og{l / y/pq))~ l . While it is hardly a "dramatic 
improvement", the fact that we deal with a slowly growing leading term loglogn, may 
indeed lead to experimentally observed significant changes in the search time. 

3 Poissonization 

In this section we consider a Poissonized version of the problem, where there are Po(A) 
strings inserted in the trie. We let F\{a) denote the a-fillup level of this trie. 

Theorem 4. Let a and p be fixed with < a < 1 and < p < 1, and let F\(a) be the 
a-fillup level for the trie formed by Po(A) random strings as above. Then, for each A > 
there is an integer 

kx = log i A - 'y ^' S-^VhA + 0(1) (5) 
21n' rl// (l/^/pg) 

such that whp (as A — > oo) F\(a) = k\ or k\ + 1. 
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We shall prove Theorem 2] through a series of lemmas. Observe first that a node at level 
k can be labeled by a binary string of length k, and that the node is filled if and only if 
at least two of the inserted strings begin with this label. For r £ {0, l} k , let iVi(r) be the 
number of ones in r, and let P(r) = p Nl ^ r )q k ~ Nl ^ r ) be the probability that a random string 
begins with r. Then, in the Poissonized version, the number of inserted strings beginning 
with r £ {0, l} k has a Poisson distribution Po(AP(r)), and these numbers are independent 
for different strings r of the same length. Consequently, 

X k = T r ( 6 ) 

re{0,l} fc 

where I r are independent indicators with 

p(J r = 1) = P(Po(AP(r)) > 2) = 1 - (1 + AP(r))e- AP(r) . (7) 

Hence, 

Var (X k ) = P ^r = !)(!- p (Ir = 1)) < 2 k 

re{0,l} fc 

so Var (X k ) < 2~ fc and, by Chebyshev's inequality, 

P(|X fc -EX fc | >2- fc / 3 )^0. (8) 

Consequently, X k is sharply concentrated, and it is enough to study its expectation. (It is 
straightforward to calculate Var (Xk) more precisely, and to obtain a normal limit theorem 
for Xk, but we do not need that.) 
Assume first p > 1/2. 

Lemma 1. If p > 1/2 and 

k = log i A - - kOV g) g-i( a) VhA + 0(1), (9) 
21n' ri// (l/^/p7j) 

ffcen EX fc = a + CKAT 1 / 2 ). 

Proof. Let p = p/q > I and define 7 by \p' y q h ~' y = 1, i.e., 

" -(f) 7 



which leads to 

_ Hn(l/g) - In A 
ln(p/q) 

Let /xj = \piq k ~i = pP~^ . By ® and ©, 



(10) 



EX fc = 2- fc ^Hp(Po(^)>2). (11) 
i=o w 

If j < 7, then ^ < 1 and 



P(Po(^) > 2) < ^ < fij. 
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If j > 7, then p,j > 1 and 

1 - P(Po(/i i ) > 2) = (1 + Hj)e~^ < 2fi je ~^ < 4/iJ 1 . 
Hence (EJ yields, using (*) < (, * ,) = O^AT 1 /*), 



- VLfc/2j; 

EX, = 2~* £ (*) Ofo) + 2"* £ (*) (1 - 0(^)) 



i<7 i>7 

i>7 v/ J'=0 v/ 

= p(Bi(fc,l/2) >7) +0(£T 1 / 2 ). 

By the Berry-Esseen theorem [BJ Theorem XVI.5.1], 

'j-k/2- 

By (fTU|) and the assumption ©, 



7 = - — ; — — (A; In In A In - 

2 hx{p/q) V q 2 q 

k In In A 



\n(p/q) V ^JM 
ln(l/VM) 



ln(p/g) V" " ol /VM 
-±(ln(l/VM)r 1/2 ^V)^A + 0(1) 



(12) 



P(Bi(fc, 1/2) > 7) = 1 - ®( 1 J=-) + Oik- 1 ' 2 ). (13) 



k-\og lUm \) (14) 



= -i$" 1 (a)A; 1 / 2 + 0(l). 

This finally implies 

1 _ sfZzj^) = i _ $(-$-!(«)) + 0(/c _1 / 2 ) = a + 0(A; _1 / 2 ), 

and the lemma follows by (|12[) and (j!3|) . □ 

Lemma 2. Fix p > 1/2. For every A > 0, i/iere exists c > smc/i £/iai «/ |fc — logjy^/pq A| < 
Afc 1 / 2 , ifeen EI t - EI fc+1 > cAr 1 / 2 . 

Proof. A string r 6 {0, l} fc has two extensions rO and rl in {0, l} fc+1 . Clearly, I r Q,I r i < I ri 
and if there are exactly 2 (or 3) of the inserted strings beginning with r, then I r Q + I r \ < 
1 < 2I r . Hence 

E(2X k -X k+1 )= E(2/ r -/ r0 -/ri) > P(Po(AP(r)) = 2). (15) 

re{0,l} fc re{0,l} fc 

Let /) and 7 be as in the proof of Lemma and let j = [7] . Then pj = p 1- "' £ [1, p] and 
thus P(Po(/ij) = 2) > \e~ p . Moreover, by (fTl)) and the assumption, 

\j _ k/2 \ < MVVm) ^!^ + 1 = 0(A ,i /2) 

ln(p/g) 
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Thus, if k is large enough, we have by the standard normal approximation of the binomial 
probabilities (which follows easily from Stirling's formula, as found already by de Moivre 

0) 

2 _ k /k\ = l + o(l) p _ 20 „ fc/2)2/fc > 1/2 
\jj y/2^k/4 

for some c\ > 0. Hence, by (|15j). 

EX k - EX k+1 = 2- k - 1 E(2X k - X fc+ i) > 2^ Q P(Po(^) = 2) > ^AT 1 / 2 

as needed. □ 

Now assume p > 1/2. Starting with any k as in 0, we can by Lemmas H and El 
shift k up or down 0(1) steps and find k\ as in © such that, for a suitable c > 0, 
EX fcA > a + \ckl 1/2 > El tA+ i and EI h+2 < EI fcj+1 - ck~ 1/2 < a - ^ck~ 1/2 . It 
follows by (jSJ) that whp X kx > a and -Yfc A +2 < and hence -Fa(o) = or k\ + 1. 

This proves Theorem 0] in the case p > 1/2. The case p < 1/2 follows by symmetry, 
interchanging p and q. 

In the remaining case p = 1/2, all P(r) = 2~ k are equal. Thus, by © and (JJJ), 

El t = P(Po(A2" fe ) > 2). (16) 

Given a G (0, 1), there is a /u > such that P(Po(/x) > 2) = a. We take k\ = \}og(X/ fj,) - 
1/2J. Then, A2~ fc * > 2 1 / 2 /x and thus EX kx > a + for some a + > a. Similarly, EX kx+2 < 
a_ for some a_ < a, and the result follows in this case too. 

4 Depoissonization 

To complete the proof of Theorem^we must depoissonize the results obtained in Theorem0J 
which we do in this section. 

Proof of Theorem^ Given an integ QT ft. let be as in the proof of Theorem 0] with A = n, 
and let A± = n ± n 2/3 . Then P(Po(A_) < n)) -> 1 and P(Po(A + ) > n)) -> 1 as n -> oo. By 
monotonicity, we thus have whp i*\_(a) < F n (a) < F\ + (a), and by Theorem 0] it remains 
only to show that we can take k\_ = k\ + = k n . 

Let us now write X k (X) and X k (X), since we are working with several A. 

Lemma 3. Assume p / 1/2. Then, for every k, 

^EI t (A) = OiX^k- 1 / 2 ). 

Proof. We have 

P(Po( M ) > 2) = -f ((1 - (1 + M )e^) = 



8 



and thus, by (|lll) and the argument in 1)12(1 . 



j=0 KJ/ 



A 

j=o 

= 0(A _1 A; _1 / 2 ) 

which completes the proof. □ 



By LemmaEl |EX fe (A±) - EX k (n)\ = C^rr 1 ^- 1 /^) = (/c -1 / 2 ). Hence, by the proof 

___ _i 1/2 1 1/2 

of Theorem^ for large n, EXfc n (A±) > a + gcA; n and EI^^fAi) < a — -jcfen , and 
thus whp F\ ± (a) = k n or k n + 1. Moreover, the estimate El^ = a + 0(l/Vlogn) follows 
easily from the similar estimate for the Poisson version in Lemma ^ we omit the details. 
This completes the proof of Theorem^for p > 1/2. The case p < 1/2 is again the same by 
symmetry. The proof when p = 1/2 is similar, now using (|16|) . □ 



5 Proof of Theorem [3] 

First, let us explain heuristically our estimate for D n (a). By the Asymptotic Equipartition 
Property (cf. [25 j ) at level k n there are about n2~ hkn strings with the same prefix of length 
k n as a randomly chosen one, where h is the entropy. That is, in the corresponding branch 
of the a-LC trie, we have about n2~ hk " ~ n l ~ h l b strings (or external nodes), where for 
simplicity b = log(l/Y^9^). In the next level, we shall have about rS 1 ~ h / b ^ 1 external nodes, 
and so on. In particular, at level D n {a) we have approximately 

external nodes. Setting this = 0(1) leads to our estimate (|1J of Theorem |3J 

We now make this argument rigorous. We construct an a-LC trie from n random 
strings £1, . . . ,£ n and look at the depth D n (a) of a designated one of them. In principle, 
the designated string should be chosen at random, but by symmetry, we can assume that 
it is the first string £1. 

To construct the a-LC trie, we scan the strings £1, . . . , £ n in parallel one bit at a time, 
and build a trie level by level. As soon as the last level is filled less than a, we stop; we are 
now at level F n (a) + 1, just past the a-fillup level. The trie above this level, i.e. up to level 
F n (a), is compressed into one node, and we continue recursively with the strings attached 
to each node at level F n (a) + 1 in the uncompressed trie, i.e. the sets of strings that begin 
with the same prefixes of length F n {a) + 1. 

To find the depth D n (a) of the designated string £1 in the compressed trie, we may 
ignore all branches not containing £1; thus we let Y n be the number of the n strings that 
agree with £1 for the first F n {a) + 1 bits. Note that we have not yet inspected any later 
bits. Hence, conditioned on F n (a) and Y n , the remaining parts of these Y n strings are again 
i.i.d. random strings from the same memoryless source, so we may argue by recursion. The 
depth D n (a) equals the number of recursions needed to reduce the number of strings to 1. 

We begin by analysing a single step in the recursion. Let, for notational convenience, 
k := h/\og(\/y/pq)- Note tnat < K < 1. 
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Lemma 4. Let e > 0. Then, with probability 1 — 0(n 

InK . . 

1 - K-e < — — < 1 - n + e. 17 
Inn 

We postpone the proof of LemmaU and first use it to complete the proof of Theorem|21 
We assume below that n is large enough when needed, and that < e < min(/c, 1 — k)/2. 

We iterate, and let Zj be the number of strings remaining after j iterations; this is the 
number of strings that share the first j levels with £i in the compressed trie. We have 
Zq = n and Z\ = Y n . We stop the iteration when there are less than Inn strings remaining; 
we thus let r be the smallest integer such that Z T < Inn. In each iteration before r, Q17|) 
holds with error probability 0((lnn) _e ^ 1 )) = 0((lnlnn) -2 ). Hence, for any constant B, 
we have whp for every j < min(r, B In Inn), with k± = k db e G (0, 1), 

In Zj 

1-K+ < < 1-K-, 

in Zj—i 

or equivalently 

ln(l-K+) < lnlnZj -lnlnZj-i < ln(l - k_). (18) 
If r > t + := [lnlnn/ln(l — k_) _1 ], we find whp from (|18[) 

lnlnZ r+ < lnlnZo + r+ln(l - k_) < 0, 

so Z T+ < e < Inn, which violates r > t + . Hence, r < r + whp. 

On the other hand, if r < r_ := [(1 — e)lnlnn/ln(l — k + ) _1 J, then whp by (|18|) 

In In Z r > In In Zq + t_ ln(l — k+) > e In In n, 

which contradicts lnlnZ T < In In Inn. 

Consequently, whp r_ < r < r + ; in other words, we need (l + O(e)) iterations 

to reduce the number of strings to less than Inn. 

Iterating this result once, we see that whp at most O(lnlnlnn) further iterations are 
needed to reduce the number to less than In Inn. Finally, the remaining depth then whp is 
O(lnlnlnn) even without compression. Hence we see that whp 

1 n 1 n n 

DM = —— r(l + 0(e) +0(lnlnlnn). 

— m(l — k) 

Since e is arbitrary, Theorem |31 follows. 

It remains to prove Lemma 0J Let be the number of the strings £i, . . . , £ n that are 
equal to £i for at least their first k bits. The Y n = Wp n ( a ) + i, and thus, for any A > 0, 



P(| logy n - (1 - «)logn| > 2elogn) < V(\F n (a) - logy^n] > AVhvn) 

+ ^2 P(|logW fc -logn + /ilog 1/v ^n| > 2elogn). 

|fc-l-log 1/v ^n|<AVlnn 

Lemma thus follows from the following two lemmas, using the observation that < 

l/log(l/vM) < Vh. 

The first lemma is a large deviation estimate corresponding to Corollary [21 
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Lemma 5. For each a 6 (0, 1), there exists a constant A such that 

P(\F n (a) - \og 1/m n\ > AV^n~) = 0(l/n). 

Proof. We begin with the poissonized version, with Po(A) strings as in Section |21 Let 
k± = k±(X) := [logi/ ^/pq ^ i Ay/In AJ , and let 5 be fixed with < 6 < min(a, 1 — a). Then, 
by Lemma ^ if A is large enough, ~EX k _ > a + 5 and ~EX k+ < a — S for all large A. By a 
Chernoff bound, © can be sharpened to 

P(\X k -EX k \>5)=0(e~ e ^) 

and thus 

P(F A (a) < fc_) < P(X fc _ < a) < ¥(X k _ - EX k _ < -5) 

= 0(e- e ( 2fc -)) = O(e- ( A ° (1) )) = OiX- 1 ). 

Similarly, ¥(F x (a) > k + ) = 0(A _1 ). 

To depoissonize, let X± = n ± n 2 / 3 as in Section |I] and note that, again by a Chernoff 
estimate, P(Po(A_) < n) = 0(n _1 ) and P(Po(A + ) > n) = 0(n _1 ). Thus, with probability 
l-0(l/n), 

MA-) < ^A_(«) < F n (a) < F A+ («) < fc+(A+), 
and the result follows (if we increase A). □ 

Lemma 6. Lot < a < b < 1/h and e > 0. Then, uniformly for all k with alogra < k < 
b log n, 

P(|logW fc -logn + /c/i| >elogn) = 0(n~ e ^). (19) 

Proof. Let N± be the number of l's in the first k bits of £i. Given iVi, the distribution of 
W fc - 1 is Bi(n - l.p^V^ 1 )- 

Since p^g 9 = 2~ h , there exists 5 > such that if \N\/k — p\ < 5, then 2~ h ~ e < 
pNx/kqi-Ni/k < 2 - h+£ , and thus 

2-hk-ek < p N lq k-N! < 2 -hfc+ £ fc 5 when I jv" x /fc — j>| < <5. (20) 

Noting that /i/c < bhlogn and 6/i < 1, we see that, provided e is small enough, n2~ hk ~ ek > 
n*? for some 77 > 0, and then (|2*U)) and a Chernoff estimate yields, when \N\/k — p\ < 5, 

P(ln2~ hk - ek <W k < 2n2~ hk - £k \ JVi) = 1 - 0{e~ e ^) = 1 - C^n" 1 ), 

and thus 

P(|logW fc -log n + hk\ > ek + I | Ni) =0(n~ 1 ), when | N x /k - p\ < 5. (21) 
Moreover, N\ ~ Bi(&,p), so by another Chernoff estimate, 

P(|JVi/Jfc - P \>5)= O(e- W) = 0{n~ e ^) . 
The result follows (possibly changing e) from this and (|2*T|) . □ 
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